Binary Neural Networks Algorithms, Architectures, and Applications (Baochang Zhang, Sheng Xu, Mingbao Lin etc.)

BONN: Bayesian Optimized Binary Neural Network

FIGURE 3.21

The images on the left are the input images chosen from the ImageNet ILSVRC12 dataset.

Right images are feature maps and binary feature maps from diﬀerent layers of BONNs.

The ﬁrst and third rows are feature maps for each group, while the second and fourth rows

are corresponding binary feature maps. Although binarization of the feature map causes

information loss, BONNs could extract essential features for accurate classiﬁcation.

Weight Distribution Figure 3.23 further illustrates the distribution of the kernel weights,

with λ ﬁxed to 1e −4. During the training process, the distribution gradually approaches

the two-mode GMM, as assumed previously, conﬁrming the eﬀectiveness of the Bayesian

kernel loss in a more intuitive way. We also compare the kernel weight distribution between

XNOR-Net and BONN. As shown in Fig. 3.24, the kernel weights learned in XNOR-Net

are tightly distributed around the threshold value, but those in BONN are regularized in a

Epoch

Accuracy

Top-1 on ImageNet

BONN-Train

BONN-Test

XNOR-Train

XNOR-Test

Epoch

Accuracy

Top-5 on ImageNet

BONN-Train

BONN-Test

XNOR-Train

XNOR-Test

FIGURE 3.22

Training and test accuracies on ImageNet when λ = 1e −4 shows the superiority of the

proposed BONN over XNOR-Net. The backbone of the two networks is ResNet-18.